Accurate, Large-Scale and Affordable Hybrid-PBE0 Calculations with GPU-Accelerated Supercomputers

نویسندگان

  • Laura E. Ratcliff
  • A. Degomme
  • José A. Flores-Livas
  • Stefan Goedecker
  • Luigi Genovese
چکیده

Performing high accuracy hybrid functional calculations for condensed matter systems containing a large number of atoms is at present computationally very demanding – when not out of reach – if high quality basis sets are used. We present a highly efficient multiple GPU implementation of the exact exchange operator which allows hybrid functional density-functional theory calculations with systematic basis sets without additional approximations for up to a thousand atoms. This method is implemented in a portable real-spacebased algorithm, released as an open-source package. With such a framework hybrid DFT calculations of high quality become accessible on state-of-the-art supercomputers within a time-to-solution of the same order of magnitude as traditional semilocal-GGA functionals. ar X iv :1 71 2. 07 97 3v 1 [ co nd -m at .m tr lsc i] 2 1 D ec 2 01 7 Accurate, Large-Scale and Affordable Hybrid-PBE0 Calculations with GPU-Accelerated Supercomputers 2

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Gauge Field Generation on Large-Scale GPU-Enabled Systems

Over the past years GPUs have been successfully applied to the task of inverting the fermion matrix in lattice QCD calculations. Even strong scaling to capability-level supercomputers, corresponding to O(100) GPUs or more has been achieved. However strong scaling a whole gauge field generation algorithm to this regim requires significantly more functionality than just having the matrix inverter...

متن کامل

A CUDA Implementation of the High Performance Conjugate Gradient Benchmark

The High Performance Conjugate Gradient (HPCG) benchmark has been recently proposed as a complement to the High Performance Linpack (HPL) benchmark currently used to rank supercomputers in the Top500 list. This new benchmark solves a large sparse linear system using a multigrid preconditioned conjugate gradient (PCG) algorithm. The PCG algorithm contains the computational and communication patt...

متن کامل

Accurate predictions of the EPR parameters in planar cobalt(II) complexes by hybrid density functional theory

The hybrid density functional theory is applied to calculate the electron paramagnetic resonance parameters, i.e, the gand A-tensors of some planar Cobalt(II) complexes with a C2v symmetry. Calculations were done for four systems: Co(acacen), Co(tacacen), Co(seacacen) and Co(sacsac)2. The following hybrid functionals were employed: B3LYP, B3PW91, mPW1PW91 and PBE0. The expected large deviation ...

متن کامل

Compiler-based code generation and autotuning for geometric multigrid on GPU-accelerated supercomputers

GPUs, with their high bandwidths and computational capabilities are an increasingly popular target for scientific computing. Unfortunately, to date, harnessing the power of the GPU has required use of a GPU-specific programming model like CUDA, OpenCL, or OpenACC. As such, in order to deliver portability across CPU-based and GPU-accelerated supercomputers, programmers are forced to write and ma...

متن کامل

Direct numerical simulation of turbulence using GPU accelerated supercomputers

Direct numerical simulations of turbulence are optimized for up to 192 graphics processors. The results from two large GPU clusters are compared to the performance of corresponding CPU clusters. A number of important algorithm changes are necessary to access the full computational power of graphics processors and these adaptations are discussed. It is shown that the handling of subdomain commun...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017